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(54) System and method for protection of digital works 



(57) A method of protecting a digital work uses a 
blind transformation function to transform an encrypted 
digital work into encrypted presentation data. The orig- 
inator's digital content is protected in its original lorm by 
not being decrypted. This method enables the rendering 
or replay application to process the encrypted document 
into encrypted presentation data without decrypting it 



first. Encrypted presentation data is then decrypted just 
before it is displayed to the user. The blind transforma- 
tion function is a function of the original transformation 
function. For example, the blind transformation function 
may be a polynomial of the original transformation func- 
tion. Alternatively, both the blind transformation function 
and the original transformation function may be any mul- 
tivariate, integer coefficient affine function. 
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Description 
Copyright Notice 

tooon A portion of the disclosure of this patent document contains material which is subject to copyright Paction. 
IhP lovriaht o^.er has no objection to the facsimile reproduction by anyone of the patent document or the patent 
£3! 1 3S« h the Patent and Trademark Office file or records, but otherwise reserves all copynght nghts 
whatsoever. 

Related Application ■ 

[0002] This application is a continuation-in-part application of application no. 09/176,529 filed October 23, 1998. 

Field of the Invention 

,00031 Theinventionrel8testodocumentrightsmanagement,an 

loTs whLh emp,oys a blind transformation to transform encrypted digital works into encrypted presentat.cn data. 
Background of the Invention 

mnndi One of the most important issues impeding the widespread distribution of digital documents or works via 
[0004] One lot the mos P inte i.ectual property rights of content owners dunng the 

electrons "^^J^^^J^ or work8 . Efforts to resolve this problem have been termed -Intellectual 
^V^.^^^^!SrS5i Property Rights Management" ("DPRM"), 'Intellectual Property Man- 
^' < ^Inr(^PM"T WgM^^™^™^^' rn!n. "Digital Rights Management- ("DRM") and -Electrons Copyright Mar, 
aoemen IPM ) BjWt a J £ Management is the underlying issue of ensuring that only authored 

1:171 pertorm opera^ on digital documents or works that they have acquire* Once accessed, the content 

L HiBtributed or used in violation of the content owner's specification of nghts. 
must not f H d,st ;' b "^ r 0 ^"° as tne lerm te used here in, is any unit of inlormation subjed to distribution or transfer, 

' Soni an othe ^ages audio and video dps. and other multimedia presentations. A document may be embodied 
9 P n^H til on oTper as digital data on a storage medium, or in any other known manner on a vanety of media. A 
^wor^ 

digital work , as ^wrm a devjce of fl $oftwafe program 

maintained in a digital torm inai can oe ^, ay ^ w k» an author is u«uallv provided to a publisher, which 

Esa^^ 

SS- ' SUSS low S ofXnTandltghTcIt o, distributing printed materia, have served as deterrent 
U^cSS- nS P^dLn««i. it isfartoo easy tocopy. modify, and redistribute unprotected elec- 
> Orients Accordingly some method of protecting electronic documents is necessary to make R harder to 
Si copy Sm. This Srve as a deterrent to copying, even if I, is still possib.e.lor example, to make hardcopies 

S21^!r2Srrt«^. Unlortunately. I has been widely receded that there is no viable way to 
TnllTuow 'making unauthorized distributions of electronic documents within current general-purpose » corn- 
prevent peopta i from na| compulers , works tations, and other devices connected over 

1 J , lm«2v orivatelv decrypted by authorized users. Cryptographic mechanisms prov,de a bas.c form o protect™ 
I ZeTZ a document distributor to an intended user over a public network, as we» as during 

during doc urner^ from. ma na 9 ement solutions rely on encrypting the digital 

d Ta e nd Si t nested message and dLryption key to the consumer's system. While different 
work and d.s Rutins bot ryP ^ ^ ^ ^ ^ nece£safy t|Qn 

Stable ^ o 71 ctusTe .o de.eauhe protection of the digital work. Considering that current genera -purpc« 
V££ £ consumer operating systems provide little in the way of sophisticated security mechanisms, the threat 
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is both real and obvious. 

[0011] A "secure container" (or simpty an encrypted document) offers a way to keep document contents encrypted 
until a set of authorization conditions are met and some copyright terms are honored (e.g., payment lor use). After the 
various conditions and terms are verified with the document provider, the document is released to the user in clear 
s form/Commercial products such as IBM's Cryptolopes and InterTrust's Digiboxes fall into this category. Clearly, the 
secure container approach provides a solution to protecting the document during delivery over insecure channels, but 
does not provide any mechanism to prevent legitimate users from obtaining the clear document and then using and 
redistributing it in violation of content owners' Intellectual property. 

[0012] Cryptographic mechanisms and secure containers focus on protecting the digital work as it is being transferred 
10 to the authorized user/purchaser. However, a digital work must be protected throughout Its use from malicious users 
and malicious software programs. Even if a user is a trusted individual, the user's system may be susceptible to attack. 
A significant problem facing electronic commerce for digital works is ensuring that the work is protected on the target 
consumer's device. If the protection for the digital work is compromised, valuable and sensitive information is lost. To 
complicate matters, today's general-purpose computers and consumer operating systems are deficient in the areas of 
is security and integrity. Frotecting the work throughout usage is a much more complex issue that remains largely un- 
solved. 

[0013] In the trusted system- approach, the entire system is responsible for preventing unauthorized use and dis- 
tribution of the document. Building a trusted system usually entails introducing new hardware such as a secure proc- 
essor, secure storage and secure rendering devices. This also requires that all software applications that run on trusted 
20 systems be certified to be trusted. While building tamper-proof trusted systems is still a real challenge to existing 
technologies, current market trends suggest that open and untrusted systems such as PC's and workstations will be 
the dominant systems used to access copyrighted documents. In this sense, existing computing environments such 
as PC s and workstations equipped with popular operating systems (e.g., Windows and UNIX) and render applications 
(e.g., Microsoft Word) are not trusted systems and cannot be made trusted without significantly altering their architec- 
ts tures. 

[001 4] Accordingly, although certain trusted components can be deployed, users must continue to rely upon various 
unknown and untrusted elements and systems. On such systems, even if they are expected to be secure, unanticipated 
bugs and weaknesses are frequently found and exploited. 

[0015] Conventional symmetric and asymmetric encryption methods treat messages to be encrypted as basically 
30 binary strings. Applying conventional encryption methods to documents has some drawbacks. Documents are typically 
relatively long messages; encrypting long messages can have a significant impact on the performance of any appli- 
cation that needs to decrypt the oocument prior to use. More importantly, documents are formatted messages that rely 
on appropriate rendering applications to display, play, prim and even edit them. Since encrypting a document generally 
destroys formatting information, most rendering applications require the document be decrypted into clear form before 
35 rendering it. Decryption prior to rendering opens the possibility ot disclcsing the document in the clear after the de- 
cryption step to anyone who wants to intercept It. 

[0016] There are a number of issues in rights management: authentication, authorization, accounting, payment and 
financial clearing, rights specification, rights verification, rights enforcement, and document protection. Document pro- 
tection is a particularly important issue. After a user has honored the rights of the content owner and has been permitted 
40 io perform a particular operation with a document (e.g., print it. view it on-screen, play the music, or execute the 
software), the document is presumably in-the-clear, or unencrypted. Simply stated, the document protection problem 
is to prevent the content owner's rights from being compromised when the document is in its most vulnerable state: 
stored, in the clear, on a machine within the user's control. 

[001 7] Even when a document is securely delivered (typically in encrypted form) from a distributor to the user, It must 
45 be rendered to a presentation data form before the user can view or otherwise manipulate the document. Accordingly, 
to achieve the highest level of protection, it is important to protect ihe document contents as much as possible, while 
revealing Ihem to the user at a laie stage and in a iorm lhai is dillicull to recover into a uselul lorm. 
|00lf • in If* known appu^* ■•*■? " electronic Cecum* r.. » i: im mn-r, ir.si r-.v.r M f.ncryplior.. an enciypted documem 
is rendered in several separate steps. First, the encrypieo oocument is received by the user. Second, the user employs 
50 his private key (in a public key cryptosystem) to decrypt the data and derive the document's clear content. Finally, the 
clear content is then passed on to a rendering application, which translates the computer-readable document into the 
finished document, either for viewing on the user's computer screen or for printing a hardcopy. The clear content is 
required for rendering because, in most cases, the rendering application is a third-party product (such as Microsoft 
Word or Adobe Acrobat Reader) that requires the input document to be in a specific format. It should be appreciated, 
55 then, that between the second and third steps, the previously protected document is vulnerable. It has been decrypted, 
but is still stored in clear electronic form on the user's computer. If the user is careless or is otherwise motivated to 
minimize fees, the document may be easily redistributed without acquiring the necessary permissions from the content 
owner. 
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,nn, oi «/hii» no svsiem is completely spool proof or immune to attack, some recenl techniques protect digital works 
L I Til o?th d'aital work to a user-specified physical device. These techniques require the user to provide 
bV ri^aSn or syst^ ^tate information from the system or physfca. device the user intends to use to render 
pnvate information or system s.ime „_,.-«„ A^med as svstem configuration information such as system 

,ne digital work. System state mformano ns tj«afly J^ ed ^^n etc. In these techniques, the digital 
parameters. CPU 

la.idale usage of the digita. work by <^;^^^pSZX^^ device seria. numbers. 
Among the user credentials are system device ^""^ « the specified device is present. 

to divulge sensitive information (e.g.. ^"^'J^ option if he/she does not wish to 

privacy issues. White the user d,vu«es the 2^5*. to provide a protection scheme that 

distributed document during the deciyption and rendering processes. 
Summary of the Invention 
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may be cryptographically less secure than the encryption used for distribution, but serves to deter casual copying, in 
this embodiment, depolarization is performed during or after the rendering process, so as to cause any intermediate 
form of the document to be essentially unusable. 

[0029] In another embodiment of the invention, a method of protecting a digital work uses a blind transformation 

5 function to transform an encrypted digital work into encrypted presentation data. The originator's digital content is 
protected in its original form by not being decrypted. This method enables the rendering or replay application to process 
the encrypted document into encrypted presentation data without decrypting it first. Encrypted presentation data is 
then decrypted just before it is displayed to the user. This method improves the overall performance of the process 
(both decryption and rendering) by minimizing the decryption overhead (since pre-rendering decryption is generally 

10 more time and resource consuming) and postponing the decryption to a late stage of the rendering process. 

[0030] Blind transformation or blind computing can be accomplished in one of several ways. Most digital works include 
formatting information, which when encrypted cannot be processed by the replay or rendering application (the trans- 
formation function which transforms a digital work into presentation data). If the digital work is encrypted with a format 
preserving encryption scheme, any transformation function may be used. This is particularly useful in that any com- 

is mercial replay or rendering application can process the encrypted digital work into encrypted presentation data. Oth- 
erwise the blind transformation function is a function of the original transformation function. For example, the blind 
transformation function may be a polynomial of the original transformation function. Alternatively, both the blind trans- 
formation function and the original transformation function may be any multivariate, integer coefficient affine function. 
[0031] Not all encryption schemes are format preserving encryption schemes. Additive encryption schemes may be 

20 used with ail document types and all associated transformation functions. In some replay or render applications, for 
some types of documents, portions of the format information may be left in the clear. In other types of documents all 
of the format information may be encrypted. In some types of documents, an additive encryption scheme may be used 
to encrypt the format information and any encryption scheme may be used to encrypt the content or data portion of 

the document. t . 

25 [0032] In particular, additive encryption schemes can be used to encrypt coordinate information of documents so 
that some rendering transformations can be performed on the encrypted coordinate data. In a special class of docu- 
ments token-based documents, for example, there are two places during the format-preserving encryption that use 
encryption schemes: one is for coordinate or location information x and y of the particular tokens within the document, 
and the other is for the dictionary of individual token images. In order to perform blind transformation on the individual 

30 coordinates of the particular tokens in the document, the first encryption scheme must be an additive encryption 
scheme However, the token dictionary may be encrypted with any encryption scheme. 

[0033] An encrypted token dictionary may still leak information such as the sizes of the token images. If this is a 
concern (such as if the token dictionary is small), the tokens can be padded with some extra bits before encryption. 
The padding can result in encrypted token images of a same size or several fixed sizes. For a token-based document, 
35 the coordinate information of the tokens in the dictionary may not be encoded. If it is desired that coordinate information 
be encoded say, as Huffman codewords, the same approach that is used to encrypt the identifiers can be used to deal 
with this situation. Basically, the cooewords in location tables are left in the clear, and the codewords in the codeword 
dictionary are hashed using some one-way hash function and their corresponding coordinate information is encrypted. 
During rendering the codewords in the location tables are first hashed and then used to lookup their encrypted coor- 

40 dinate information. . 

[0034] In another embodiment of the invention, a digital work and a system context (or resource information or system 
resource) are polarized enabling trusted rendering or replay of the digital work without depolarization of the digital 
content In this embodiment, the digital work is of the type which includes digital content and resource information. 
Resource information may include information used by a replay application to format or process the digital work into 

45 presentation data. Resource information may include, for example, a collection of system resources available to the 
replay sonware on a panicular sysiem, such as the Foni Table. Color Paieite, System Coordinaies and Volume Setting. 
|0035) DiHerenl types ol digital works may be polarized, in adowon 10 polarizing typical Document lype digital works, 
ujcu-u* vicifec.c;ioiU.iwi-.n:r«.nhf pc fe.iCfrf. ll* ci^ii;.: win i.i.c rym : ,. . ;.,,u ; . .vim*! ti:.^f gu man 
tacturer or content owner's location using a polarization engine. A polarization engine is a component used to transform 

so the digital work and system context to their respective polarized forms. The polarization engine employs a polarization 
scheme which relies on some polarization seed, an element used to initialize and customize the polarization engine. 
[0036] Various polarization schemes may be used to polarize a digital work. For example, a stateless polarization 
employs a random number as a seed to transform a digital work into a polarized digital work. A state-based polarization 
scheme employs a seed based on a system state or characteristic ol a system to transform a digital work into a polarized 

55 digital work that is associated with that system state or characteristic. A dynamic state-based polarization scheme 
employs a seed based on a dynamic system state or characteristic to transform a digital work into a polarized digital 
work. In this embodiment, the polarized digital work will typically be provided with a polarization engine for repolarizing 
the encoded digital work and the encoded system context according to the dynamic state-based polarization scheme 
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each time the system requests replay of the digital work. An authorization-based polarization scheme employs a seed 
b-sed on authorization information received from a trusted source to transform a digital work into a polarized digital 
work For further security, the polarized system context can be stored separately from the polanzed digital work in a 
removable context device, which must be coupled to the system prior to use of the digital work. 
100371 Preferably the polarization seed contains information which can be used to tie the particular digital work to 
the ultimate end user or an ultimate end user system. Typically the owner or distributor will select the type of polanzation 
scheme to be used in polarizing the digital work and the type of polarization key to use depending on the value of the 
diortal work Like encryption schemes, polarization schemes come in different levels of complexity and strength. When 
a dioital work is ordered, a copy of a portion of the digital work's resource information, called the system context, js 
made The polarization seed is selected and both the digital work and the system context are polarized. A different 
Polarization scheme may be used for the system context than is used for the digital work. However the polanzation 
seed is the same for both. The polarized digital work and polarized system context are then provided to the user for 
renlav or renderina on a replay or rendering system. 

Si In the format preserving encryption and trusted rendering embodiment of the invention, protection is prov,ded 
until the encrypted presentation data must be decrypted into clear presentation data. In this embodiment of the inven- 
Son. the replay application uses the polarized resource information to transform a polarized digrtal work mto clear 

Eow^^ntShe digital content of a digital work is polarized, leaving the resource information unpolarized or in the 
clear the replay application will be able to process the polarized digital work into polarized presentation data. This 
means a depolarizer must depolarize the presentation data into clear presentation data suitable for v,ew.ng or use by 
me user If a portion of a digital work's resource information is also polarized accordingly, when the replay eppl.cat.on 
transforms the polarized digital work, the replay application uses the polarized system resourwinformat^nt^ 
he polarized digital work into clear presentation data. All or just a portion of the required resource '"formation may be 
oolarized The replay is blind in that the replay application does not see the original, unpolarized d.gita content. 
[00401 in this embodiment, a polarized digital work is transformed by the replay application using a polaraed system 
ccmelt (resource information) to create clear presentation data; the replay application can be any commerce. «MM 
pany^ 

enoine is required. The replay application operates as a blind replay system (rt processes polanzed digrtal content 
2 p anza system resources) and relies on a type of polarization which transforms or encodes he djrta. work 
such that the ability to replay ft using a software program or device is lied to a specrfic resource information, thus 

\ZT^Z^TeZy encryption to protect the digital work end eventually decrypt the digital I work 
nto its clear form before the digital work is provided to the replay application, the blind replay system keeps the digrta 
wo* entodedTthe polarized form (there is no explicit decoding step in the blind reply) until the last possib e moment 
S rX process in the blind replay system, the polarized digital work itself is never depolanzed .n the clear. S.nce 
or^ZZ Z * generally of a lesser quality than the original digital work, even if the presentat.on data -scaptured 
in its clear lorm, it cannot be easily (if at all) transformed back into the original digrtal work. 
004H Many different types of digrtal works and their resource information may be polarized and replayed .n a bl nd 
Slay svstem Digfta. worEs such as documents, text, audio files, graphics files and video files may be replayed ,n the 
40 blind replay sysiem of the invention by polarization of an appropriate resource information. 

Brief Description of the Drawings 

[0043] The structure and function of the invention is best understood with reference to the included drawings, which 
is may be described as follows: 

FIGURE 1 is a top-level block diagram representing a model lor the creation and commercial distribution of elec- 
tronic documents in either secure or insecure environments: . ^ 
FIRURF J> i« f flow rlir.orr.r,. illt.r irr.i.np the *mn,ri, t ! r .nr-r.i^ <-i*,-ur. r ,i<- rir r umenlr according 10 the art; 
FIGURE 0 is a now d.ag.an, H.usiraung tne oecrypt.on oi protected electronic oocumenis accoro.ng to e s.mple 
embodiment of the invention; . 
FIGURE 4 is a flow diagram illustrating the decryption of protected electronic documents according to a preferred 

FIGUrHIs a funcSbl^ck diagram illustrating the data structures present in a setf-protecting document ac- 
55 cordino to an embodiment of the invention; 

FIGURE 6 is a flow diagram illustrating the creation and customization of a self-protect.ng document accord.ng lo 

an embodiment of the invention; # 

FIGURE 7 is a flow diagram, from a user's perspective, illustrating the actions performed in handling and using a 
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self-protecting document according to the invention; 

FIGURE 8 is a graph illustrating several posstole paths between an unrendered and encrypted document, and 
rendered and decrypted presentation data; 

FIGURE 9 is a flow diagram illustrating a polarization process according to the invention in which document format 
5 information remains in the clear for rendering. i 

FIGURE 1 0.is a block diagram of a method of format preserving encryption and trusted rendering according to the 
invention; 

FIGURE 11 is a simple example of a document to be tokenized; 
FIGURE 12 is the token dictionary for the document of Fig. 11 ; 
10 FIGURE 13 is the location table for the document of Fig. 11; 

FIGURE 14 is a block diagram illustrating a process for generating a polarized digital work and polarized system 
resource according to the invention; 

FIGURE 15 is a block diagram illustrating the conversion of a digital work into image data according to the art; 
FIGURE 16 is a block diagram illustrating a system for blind replay of a polarized digital work according to the 
15 invention; 

FIGURE 17 is a block diagram illustrating another system of blind replay of a polarized digital work according to 
the invention; 

FIGURE 1 8 is a block diagram of an example structure of a digital document; 
FIGURE 19 is an example digital document; 
20 FIGURE 20 is an example of the digital document of Fig. 16 after it has been polarized; 

FIGURE 21 is block diagram of an example structure of a resource information or system context for a digital 
document; 

FIGURE 22 is a block diagram of an example font table; and 

FIGURE 23 is block diagram of the font table of Fig. 22 after it has been polarized. 

25 

Detailed Description of the Preferred Embodiments 

[0044] The invention is described below, with reference to detailed illustrative embodiments. It will be apparent that 
the invention can be embodied in a wide variety of forms, some of which may be quite different from those of the 
30 disclosed embodiments. Consequently, the specific structural and functional details disclosed herein are merely rep- 
resentative and do not limit the scope of the invention. 

[0045] Figure 1 represents a top-level functional model for a system for the electronic distribution of documents, 
which as defined above, may include correspondence, books, magazines, journals, newspapers, other papers, soft- 
ware, audio and video clips, and other multimedia presentations. 

35 [0046] An author (or publisher) 1 1 0 creates a document's original content 112 and passes it to a distributor 1 14 for 
distribution. Although it is contemplated that the author may also distribute documents directly, without involving another 
parry as a distributor, the division of labor set fcrth in Figure 1 is more efficient, as it allows the author/publisher 110 to 
concentrate on content creation, and not the mechanical and mundane lunctions taken over by the distributor 114. 
Moreover, such a breakdown would allow the distributor 114 to realize economies of scale by associating with a number 

40 of authors and publishers (including the illustrated author/publisher 110). 

[0047] The distributor 1 1 4 then passes modified content 1 1 6 to a user 1 1 8. In a typical electronic distribution model, 
the modified content 116 represents an encrypted version of the original content 112; the distributor 114 encrypts the 
original content 112 with the user 118's public key, and modified content 116 is customized solely for the single user 
1 1 8. The user 1 1 8 is then able to use his private key to decrypt the modified content 1 1 6 and view the original content 

45 112. 

[0048] A payment 1 20 for the content 1 1 2 is passed irorn the user 11 8 to the distributor 1 1 4 by way ol a clearinghouse 
122. Tne clearinghouse 122 collects requests Irom the user 116 and Irom other users who wish to view a particular 
en* UI..H... *j im- ciefcrinpiii i?m vj: us t ;.i.|ii i*u p.- r.a iii inn nr., uu ; : c ■ i i. it.-:*: .miu mi: rn fiii cur! irr.ns - 
tions ; or other known electronic payment schemes, and lorwards the coliecteo users* payments as a payment batch 

50 124 to the distributor 114. Of course, it is expected that the clearinghouse 122 will retain a share of the user's payment 
120. In turn, the distributor 114 retains a portion of the payment batch 124 and forwards a payment 126 (including 
royalties) to the author and publisher 110. In one embodiment of this scheme, the distributor 114 awaits a bundle of 
user requests for a single document before sending anything out. When this is done, a single document with modified 
content 11 6 can be generated for decryption by all of the requesting users. This technique is well-known in the art. 

55 [0049] In the meantime, each time the user 118 requests (or uses) a document, an accounting message 1 28 is sent 
to an audit server 130. The audit server 130 ensures that each request by the user 118 matches with a document sent 
by the distributor 114; accounting information 131 is received by the audit server 130 directly from the distributor 114. 
Any inconsistencies are transmitted via a report 132 to the clearinghouse 122, which can then adjust the payment 



7 



EP 1146 715 A1 



K ,, c15 . marfp t0 the distributor 114. This accounting scheme is present to reduce the possibility of fraud in this 

- we " - » ^ « me - dependem usa9e p * mM that may resutt 

10052] Al 11* oelset an eneiypnM document 210 is reeenneooy ine use n. 
is similar or identical to the original content 112 (Figure 1). constructs presentation data 220. or 

document type. r.ira this The clear content 216 can be copied, 

10054] As discussed above, the document is vulnerable m system £ ^J^J^J orthe author/publisher 
stored or passed along to other users without the Kno^edge or co^entof „ clear „ 

[0056] Figure 3 looks similar to Figure 2, in that an encrypted <^ rt « ^£ tB Yl8 However, an addi- 
which useL private key 314) and a rende^^^^ 

tional layer of protection is prov.ded by a protect ng "^^^SSJoSmI 216 of Figure 2) available to be 
decrypted and rendered without ever leaving clear content (as ' "^.SS^n Ita document 310. as will be 
intercepted. This is accomplished by are adapted to iimi, the 

■ r™w^ 

adapted to secure the document after it has been ^ 

cation 424. As discussed above, typical rendenng ZZTZSo^m not be able to process the 

Adobe Acrobat Reader. However, it is likely that such eternal l ^^^^^ m ^ Hi wiHhave been 
polarized contents 420, as the contents, any formatting codes, and other cues usee oy 

ESDnSS^ 

f 0 t62i%roC^Tt F ;s 
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rendering application 424 but is still polarized, and hence not readable by the user. The polarized presentation data 
426 is passed to a depolarizer 428, which receives the polarization key 418 and restores the original form of the 
document as presentation data 430. In one embodiment of the invention, the depolarization function is combined with 
the rendering or display function. In this case, the polarized presentation data 426 is received directly by a display 

5 device, which can be separate from the user's system and receive data over a communications channel. 

[0063] Creation of the polarization key 418, the rendering application 41 8, and the depolarization step 428 are all 
elements of the protecting shell 422; these are tamperresistant program elements. It is contemplated that all compu- 
tational (or transformation) steps that occur within the protecting shell 422 will use local data only, and will not store 
temporary data to any globally accessible storage medium or memory area; only the explicit results will be exported 

1 0 from the protecting shell 422. This approach will prevent users from easily modifying operating system entry points or 
scavenging system resources so as to intercept and utilize intermediate data. 

[0064] It should be noted that the presentation data 430 of Figure 4, in alternative embodiments of the invention, can 
be either device independent or device dependent. In the device-independent case, additional processing by a device 
driver (such as a display driver or a printer driver) typically is necessary to complete the rendering process. In the 
is presently preferred device-dependent case, the device-specific modifications to the presentation data have already 
been made (either in the rendering application 424 or the depolarizing step 428), and the presentation data 430 can 
be sent directly to the desired output device. 

[0065] The decryption schemes described with reference to Figures 3 and 4 above are enabled by a unique document 
structure, which is shown in detail in Figure 5. As discussed above, certain operations performed by the system and 
20 method of the invention require trusted components. One way to ensure that certain unmodified code is being used to 
perform the trusted aspects of the invention is to provide the code along with the documents. The various components 
of a self-protecting document according to the invention are illustrated in Figure 5. 

[0066] The problem of document protection is approached by the invention without any assumptions on the presence 
of trusted hardware units or software modules in the user's system. This is accomplished by enhancing a document 

2S to be an active meta-document object. Content owners (i.e., authors or publishers) attach rights to a document that 
specify the types of uses, the necessary authorizations and the associated lees, and a software module that enforces 
the permissions granted to the user. This combination of the document, the associated rights, and the attached software 
modules that enforce the rights is the self-protecting document ("SPD") of the invention. A self-protecting document 
prevents the unauthorized and uncontrolled use and distribution of the document, thereby protecting the rights of the 

30 content owners. 

[0067] The self-protecting document 510 includes three major functional segments: an executable code segment 
512 contains certain portions of executable code necessary to enable the user to use the encrypted document; a rights 
and permissions segment 514 contains data structures representative of the various levels of access that are to be 
permitted to various users; and a content segment 516 includes the encrypted content 116 (Figure 1) sought to be 
35 viewed by the user. 

[0068] In a preferred embodiment of the invention, the content segment 51 6 of the SPD 510 includes three subsec- 
tions : Document meta-information 518 (including but not limited to the document's title, format, and revision date), rights 
label information 520 (such as a copyright notice attached to the text ? as well as rights and permissions information), 
and the protected content 520 (the encrypted document itself). 

40 [0069] in one embodiment of the invention, the rights and permissions segment 514 includes information on each 
authorized user's specific rights. A list of terms and conditions may be attached to each usage right. For example, user 
John Doe may be given the right to view a particular document and to print it twice, at a cost of $10. In this case, the 
rights ana permissions segment 514 identities John Doe, associates two rights with him (a viewing right and a printing 
right), and specifies terms and conditions including the price ($10) and a limitation on printing (twice). The rights and 

45 permissions segment 514 may also include information on other users. 

[0070] in an alternative embodiment tne rights and permissions segment 514 includes only a link to external infor- 
mation specilying rights inlormaiion. in sucn acase.. Uie aciual rights ano permissions are stoied elsewhere, lor example 
i,r.LiiCiwoiKeu|»errriitsK i.ttrivn. wi.irl.i..us ii * ribC.ft.vi.iinv- ih* n- ... i:fc-Oi.n epr^iLrr.pu'vicift! 

the aoveniage that rights an<S permissions may be updated Dynamically Dy the conient owners. For example, tne price 

so for a view may be increased, or a users rights may be terminated i1 unauthorized use has been detected. 

[0071] In either scenario, the rights and permissions segment 514 is cryptographically signed (by methods known 
in the art) to prevent tampering with the specified rights and permissions; it may also be encrypted to prevent the user 
from directly viewing the rights and permissions of himself and others. 

[0072] The executable code segment 512, also called the "SPD Control," also contains several subsections, each 
55 of which comprises a software module at least partially within the executable code segment. In one embodiment of 
the invention, the Java programming language is used for the SPD Control; however, it is contemplated that any plat- 
form-independent or platform-specific language, either interpreted or compiled, can be used in an implementation of 
this invention. 
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[0073, A rights enforcer 524 is present to verHy the to JJjJ™- SfiiS 

l and polarize the data according to he 'g^^^^^^ScSL before it is stored or decrypted, so 
embodiment of the invention, the pclanzafon f^Jf^J^Sfe^n engine 526 is secured, that is. it is 
the document is never stored in the clear on the "^^^J^- and disassembling, 
cryptograph^ signed and encrypte ^ 

10075] A counterpart depolanzat.on engine 528 « also ^ nc.uo ^ ^ pjmmg 

from the polarized content (see F.gure 4). The depolar zauon engm ^ secure 

a relatively tamper-proof interface to r «^£^^dng ihe^MSiWtty that the document. In its clear town, 

window objects are resistant to be.ng operating system, 

can be reconstructed by '^^^^^^ enable itineration of clear presentation data 
[0076] A counterpart depolanzat.cn "^jV^tin. 528 provides a relatively tamper-proof interface 
from the polarsed content (see F.gure J™^J^S^2*«J. The input to the depolarization engine 528 Is 

10 r rrrar^ ■ - * — any * tne c,ear con,ent ww,out 

[0077] Asecurevtewer530isop..ona..y "^^.^.SliS^Srtl^Io th« rigM. and p-nH«ion. «gm«« S14, 
umO to perm« on,y those levels of (and not to save or print tt). the viewer will 

^pe^^ ostmodemoperating 

r?oT>ina,M -daring eng,ne532.s,n.,u^ 

Log engine 532 need not be r^£^£^^Jm^ h, either case, the rendering engine 

specification 614: and an optional watermark ; 616. as desjred by , he author or publ i S her. 

[0081] The content 612 is pre-processed (step 61 8) to lay out in fe essentja|ly . pre . ren . 

For Sample, a preferred page size, font, and ^ J^SSSt compatible with users' systems and the 
dered" in the content pre-process.ng step so that n w.« be in a tomanmn p op) ^ 

SPD For example, the content 612 may be converted from Mcr osoft wore i .u *l embodiment of the 

foTdffleren, Jmat special* adapted to be read ^^y^^J^SX - — in «» 
invention, multiple versions of the content 612 are ' £y the user according to his needs, 

generic SPD 610: those dfflerent vers.ons may then JV^^X!-*" of access rights are permissible. Such a 
[0082] The high-level rights *P«« fc *'^^ 
rights specificat.onistailoredtoapam^ 

classes of downstream users. For '^^^SSw S2 00 royalty. Similarly, usere may be gtven the 
opt.on to pu.cr.8se a vers.on ol the document ina 

imitations are oescribed with reference to ad eta led exampfc «mr lor di g rta . works. It 

,ua.:-, U.on,t .-.openv i . .-..u.oe (DPS , « * ^^e^e* and entorced for rights. Rights speci- 
provioes a mecr.an.sm in which ^^£"£££Z ^ examp.e, U.S. Patent No. 5.715.403 to Stefik. 
fications are represented as statements ,n DP^/"^^j Having Attached Usage Rights Where the 

w«h rights is performed usin Jthe 0 , . digital work using a "work- specrffcation. Within a work 

[0084] Diffe.ent rights can ^^ZTrXs^Ve specified. Rights can be grouped into named-groups 
specification, diflerent sets of ^^^^^^J^^ a set of conditions. Conditions can be of 
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can be performed, and so on. DFRL allows different categories of rights: transfer, render rights, derivative work rights, 
file management rights and configuration rights. Transport rights govern the movement of a work from one repository 
to another Render rights govern the printing and display of a work, or more generally, the transmission of a work 
through a transducer to an external medium (this includes the "export" right, which can be, used to make copies in the 
clear) Derivative work rights govern the reuse of a work in creating new works. File management rights govern making 
and restoring backup copies. Finally, configuration rights refer to the installation of software in repositories. 
An exemplary work specification in DPRL is set forth below: 

(Work: 

(Rights-Language- Version: 1.02) 

(Work-ID: "ISDN-1-55860-166-X; AAP-2348957tut") 

(Description: •Title: 4 Zuke-Zack, the Moby Dog Story' 

Author. 'John Beagle 9 

Copyright 1994 Jones Publishing") 
(Owner (Certificate: 

(Authority: 'library of Congress") 
(ID: "Murphy Publishers"))) 
(Parts: "Pboto-Celebsbots-Dogs-23487gfj" "Dog-Breeds-Chart-AKC") 
(Comment: "Rights edited by Pete Jones, June 1996.") 
(Contents: (From 1) (To: 16636)) 
(Rights-Group: "Regular" 

(Comment- 'This set of rights is used for standard retail editions.") 
(Bundle: 

(Time: (Until: 1998/01/01 0:01)) 

(Fee: (To: "Jones-PBLSH-18546789"XHouse: "Visa"))) 

(Play: 

(Fee: (Metered: (Rate: 1.00 USD) (Pen 1:0:0) (By: 0:0:1)))) 

(Print: 

(Fee: (Per-Use: 1 0.00 USD)) 

(liliiU: 

(Certificate: 

(Authority: "DPT" 

(Type: TnistedPrinter-6"))) 

(Watermark: 
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(Watennark-Str. "Title: 4 Zeke Zack - the Moby Dog' Copyright 
1994 by Zeke Jones. All Rights Reserved.") 
(Watermark-Tokens: user-id institution-location render-name 
render-time)))) 

10 (Transfer.) » 

(Copy.CFtt- (Per-Use: 10.00 USD))) 

(Copy: (Access: 
is (User. (Certificate: 

(Authority. "Murphy Publishers") 

(Type: "Distributor"))))) 

(Delete:) 
(Backup:) 

(Restore: (Fee: (Per-Usc: 5.00 USD))))) 



25 



30 
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45 



r 0 0851 This work specification has a rights group caBed -Regular," which specif.es rights for standard Irel tall . NDjtoM 
ik ^ ,» L "2uke-Zack the Moby Dog Story' The work specification expresses condrtrons for several nghts: play 
Ztll' coT*^: ISSS- -store. The wort in the example inc.udes two other parts, a photograph and 
?£an7*e£™corpora* from other sources. A 'bundle" spec«ication bundles a set of common cond. rons that 
Lolv to alf noma in the group. This specification states that all rights in the group are valid unt.l January 1 . 1998 and 
mauhe fe should be paid to account "Jones-PBLSH-i 8546789". The clearing-house for this ransachon ^houW be 

I .JIh 7h! work can be printed on TrustedPrinter-6 which is certified by "DPT" for a fee of $10.00 per pnnt, the 
f T^t liS Ea« a watelark string (as depicted) and a list of tokens signifying "fingerprint" information known 

lihy^bfishing; and unrestricted transfer, deletion or backing up of this ^j^SS^^^ 
IO0861 The hiah-level rights specification 61 4 is also subject to a pre-processing step (step 620 . n wheh »e nijn 
S,. human "readable) specrfication is compiled into a more-efficient data structure represents for use by the 



00871 The oeneric SPD 61 0 is then created (step 622) by combining the pre-processed content 612, the ,p e-proo. 
Srinhts soecLation 614, and the watermark 616. A watermark may be added by any means known m the art « 
ma within the SPD. The generic SPD 610 may a.so optional be encrypted by the 
anthnr/nuhlisher 11 0 for transmission to the distributor 114 (Figure 1). 

moS The oenei SPD 610 is then received by the distributor 114. and is stored for later When a 

uTreouest 624 ts received by the distributor 114 (either directly or through the clearinghouse 122 or other interme. 
Z, the sutou" "4 creates a set ot user permissions (step 626) .hat is consistent with both the user request 624 
Z Mhe r^Sication 614. .. tnere is no such consistent se, ot permissions, tnen no further actron ,s performed 

632 adap ed to be used by the user. The user permissions from step 626 are stored m he 
SL G nt 514 o° the SPD 632, and the user's public key 628 is used to encrypt the content .n the content aegment 616 
of the SPD 632 A pubHc-key encryption mechanism can be used to transform the SPD from the genenc form to the 
customized S^PD 632. Such a melanism is useful it the SPD has to be confidential* transfe ^d between^ flerent 
tri„ author xo publisher to retailerto consumer, with rights protection at each stage. It should further be noted 

!nTX are Z^Tlg public keys to encrypt a document such that any of the users' pnvate keys 

can be used to decrypt it. 
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[0090] The resulting custom SPD 632 is then transmitted to the user 118 by any available means, such as via a 
computer network or stored on a physical medium (such as a magnetic or optical disk). 

[0091] The operations performed when a user receives an SPD are depicted in the flow diagram of Figure 7. The 
SPD is first received and stored at the user's system (step 710); in many cases, it is not, necessary to use the SPD 

5 right away. When usage is desired, the user is first authenticated (step 712), typically with a user name and a password 
or key. The system then determines what action is desired by the user (step 71 4). When an action is chosen, the rights- 
enforcement step of the invention (step 716) verifies the conditions associated with the desired action (such as the 
fee, time, level of access, watermark, or other conditions); this can be performed locally via the SPD applet 51 2 (Figure 
5) or by accessing a rights enforcement server. 

10 [0092] If the rights enforcement step (step 71 6) fails, an update procedure (step 71 8) is undertaken. The user may 
choose to update his permissions, for example by authorizing additional fees. After the satisfactory verification of con- 
ditions, a pre-audit procedure (step 71 B) is performed, in which the SPD system logs verification status to a tracking 
service (e.g., the audit server 130 of Figure 1). The content is then securely rendered to the screen (step 722) as 
discussed above. When the user is finished, a post-audit procedure (step 724) is performed in which the amount of 

75 usage is updated with the tracking service. The SPD system then awaits further action. 

[0093] The protection yielded by the SPD is derived from the user's inability to capture a useful form of the document 
at any intermediate stage during the rendering process. This is accomplished by Decrypting the document contents to 
a clear form at the latest possible stage, ideally In the last step. 

[0094] The SPD decryption model is illustrated in Figure 8. E denotes the encryption function performed by the 
20 publisher D denotes the decryption performed at the user's system, and R denotes the rendering transformation. Many 
prior systems use a first sequence of transformations 810, D(E(x)) followed by R(D(E(x))). As stated previously, the 
early decryption leaves the document in a vulnerable state. Ideally, the transformations are performed in the reverse 
order 812, R'(EM) followed by D(R'(E(x))). This postpones decryption to the latest possible time. 
[0095] The existence of R\ a rendering operation that can be performed belore decryption, is determined by the 
25 . following equality: 

D(R'(E(x)))=R(D(E(x))) 

30 In case that the encryption and decryption functions are commutative, that is. E(D(x)) = D(E(x)) for any x, the existence 
of R' is ensured: 
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R' (y) = E(R(D(y))) fory = E(x) 



In practice, encryption and decryption functions in popular public-key cryptographic systems such as the RSA system 
and EiGamal discrete logarithm system satisfy the commutation requirement. This means that the transformation R' 
exists if these cryptographic systems are used for encryption and decryption. 

[0096] The path x' = D(R'(E(x))) portrays an ideal SPD solution to the document protection against unauthori2ed 
40 document usage and distribution. A scenario of distributing and using a cocument can be described as follows. When 
a user purchases the document, the document is encrypted using a user's public information and is transmitted over 
an insecure network channel such as the Internet. The encrypted document has the rights information attached to It 
and a protecting applet 512 that enforces the rights and permissions granted to the user by the content owner. Upon 
a user's request on using the document, the applet verifies the rights and permissions and generates from the encrypted 
45 document the presentation format of the original document. As any intermediate form of the document before the final 
presenlalion data is encrypted with the user's private inlormalion : the SPD model ot document protection ensures that 
any imermetiiaie lorm ot the document 15 not useful to other systems wnerever it is intercepted. 
lUUS:^ Cieariy. this ioe^i iv.coh rents or, whetm-i l: hi . irtr.f ioir,.r.»n.i. I •' m;:i corresponds to the renaerir.c. 
translormation R can be comfJuted efficiently, and in particular on wheiner or not an invocation ot the Decryption function 
so D is^necessary during an implementation of R\ A trivial case in which R' can be implemented efficiently is where R is 
commutative with the encryption function E. When this happens, 



R l (y) = E(R(D(y))) = R(E(D(y))) = R(y) 

for y = E(x). In this case, R' = R. 
[0098] Consideration of Figure 8 reveals tnat many intermediate solutions (e.g., intermediate solutions 814, 816, 
and 818) to the document protection problem may exist on the user's system between the two extremes x* = R(D(E 
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(x))), which has no protection on x = D(E(x)), and x* = D(fi'(E(x))), which has ideal protection (under the assumptions 
set forth above). As depicted in Figure 8. one may consider different paths from the encrypted document E(x) to the 
presentation data x' that correspond to different combinations of partial rendering transformations and partial decryption 
transformations. Again, it should be recognized that delaying the decryption D in any path increases the protection 

5 level to the document. • . , 

10099] As discussed above, one alternative method of delaying decryption to the last possible moment employs a 
polarization technique that enciypts only the document contents, not the format or the entire document as a whole. 
This possibility is shown in Figure 9. Beginning with the clear document content 910 (which, it should be noted, does 
not exist in any single identifiable location during the user's processing, but is rather a transient state occurring within 

io step 41 2 of Fioure 4), the document is split (step 912) into a data portion 914 and a format portion 91 6. The data portion 
914 is polarized (step 918) using the polarization key 920 and merged (step 922) with the clear format portion 916. 
This results in polarized content 924 that can be rendered to polarized presentation data without first decrypting the 
content It should be observed that this form of polarization is likely less secure than wholesale encryption with the 
polarization key, since a lot of information can potentially be derived from the layout of a document, word lengths, line 

is lengths etc • however, this scheme will present a uselul deterrent to casual copyright infringement. 

[01001 ' A method of protecting a digital work during replay which employs a blind transformation function is shown 
with reference to Figure 10. In Figure 10, an encrypted digital wortc 1010 is provided to replay application 1012. Digital 
work 1 01 0 has been encrypted with a format preserving encryption scheme which enables replay application 1012 to 
generate encrypted presentation data 1 01 6. Encrypted presentation data 1 01 6 is then sent to decryption engine 1 01 8 

zo where it is decrypted into clear presentation data 1020. Presentation data is now in the clear, but less l.kely to be 
regenerated into the original digital form. If presentation data 1020 can be viewed or used directty by the user, then no 
further processing is required. However, sometimes an additional rendering is required by a display system such as a 
printer In such a case, presentation data 1020 is provided to the display system's rendering application (in the case 
of a printer this could be a decomposer) 1022 which generated image data 1024. Image data 1024 is then prov.ded 

25 to display device 1026. 

[0101] in a general context, the problem of blind transformation can be stated as follows. Suppose a dent Cathy 
wants a server Steve to compute for her a function value F(a,x) with his (public or private) data a and her pnvate data 
x and Cathy wishes for privacy concerns, that the transformation is done without Sieve knowing her pnvate data x 
and the function value F(a,x). From Steve's point of view, this means that he computes F(a,x) for Cathy but with his 

so eyes blindfolded What this means is that Cathy would like the server Steve to perform the transformation only with 
data E„(x) encrypted using Cathy's key k. and return to her the function value E k (F(a.x)) again encrypted using her key 
k If Steve can perform the transformation using encrypted data, then Cathy has avoided disclosing the data x in the 
clear and the result F(a,x) in the clear. The ideal model of blind transformation with partially encrypted data is shown 



35 



below: 



(a,x)— &-»(fl,£(x)) 

Fi ir 

40 F{atX )<— r -F\a,E(x)) 



i- 



The function F' mat makes the diagram commute is what Steve really computes, and the transformation result F'(a,E„ 
(x)) = E k (F(a x)) is ready tor decryption to reveal the desired function value F(a.x). As Steve does not "see- the clear 
45 data x as we'll as the function value F(a.x), he carries out a -blind" transformation for Cathy. 

[01 02] A orotocoi for blind transformation can be described as follows for the blind evaluation of the function F(a,x,: 

(i) Cairiv encrypts x usinp her encryption key k ; resulting E h (x). 

lii' (..mi,- r<-r.fls t,.i:^t< ir> < . 
h (iii) Sieve ev&iuaiei the modilieo version I-' ol the lunction I- at li.e cifcfai oaifa fa ai.o encrypieo oata fc»ix). 

(iv) Steve returns the result F'(a,E k (x)) back to Cathy. 

(v) Catny decrypts F'(a,E k (x)) using her decryption key Ir 1 and obtains F(a,x). 

[0103] The ideal model of blind transformation introduced here can be regarded as a generalization of blind signatures 
55 and instance hiding. Blind transformation now allows partially encrypted data as input and, more importantly, it permits 
the function F' that the server computes to be possibly different from the intended function F. By computing P instead 
of F the server though still blindfolded, is aware of the input being partially encrypted and hence is cooperative with 
the client The blind transformation and secure mobile computing share a common goal in keeping the funclion value 
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that the server computes private to the client, but they differ in that the client supplies the data input and the server 
supplies (a program that evaluates) the function in blind transformation, while it is the other way around in secure 
mobile computing. Note that blind transformation allows some portion of the data (e.g., a) to be in clear This enables 
use of some dynamic yet clear data in the rendering process, such as display window size, reference positions for 
shifting content, scaling factor and coefficients in a rotation operation. , 

[0104J Blind transformation works only if there exist functions F and P to compute the encrypted data. It can be 
shown that multivariate, integer coefficient affine functions using additive encryption schemes permit many document 
rendering functions of the affine type on the x- and y-coordinates to be evaluated in blind transformation. For a given 
encryption scheme S, a function F: X -» X is said to be S-btinOy computable if there exists some function F : X -* X 
such that the computational complexity for evaluating P is a polynomial of the one for evaluating F, and 

F(a l x) = D M (P(a t E k (x))) 

for any k 6 K and x E X. A function F: X -> X is said to be blindly computable if there exists an encryption scheme S 
with X being a subset of its message space such that F is S-blindly computable. 

[01 05] Any multivariate, integer-coefficient affine function is S-blindly computable for any additive encryption scheme. 
Specifically, let 

k 

be a multivariate affine function with a constant Xq 6 X, integer coefficients 3j and variables x v ... x k in X. Then, for 
any key k£K, there exists a computationally efficient function 



such that 



Indeed, the constant y 0 and integer coefficients b, in F^ ^ can be taken to be y 0 = E k (x 0 ), b, = a,, i= 1 k. The 

blind transformation of multivariate, integer coefficient affine functions using additive encryption schemes allows many 
document rendering functions of the affine type on the x-and y-coordinates to be evaluated in the blind manner, pro- 
viding a theoretical foundation tor the format-preserving encryption and trusted rendering of documents described 
herein. 

[01 06J A document is usually a message that conforms to a certain format. For document encryption, in addition to 
simply encrypnno me entire document, there are many different ways lo encrypt only some pans of the document. The 
poal here is thaMhe inlormaiion leakape aboul me unencrypted portion cannol iw iiFed. or il il does leak, il iscompu- 

tCUl-liLlO liilik'JUll U ItVOOliS If t!C; It if- CK-.o.. M H.ll'ifci i. < >. illVu- ; 

10107] II an encryption scneme which preserves lormatting information ot tne Digital work, then any transiormation 
function (replay application or rendering application) may be used. An example of a format preserving encryption meth- 
od is described for convenience with reference to token-based documents. The method for format-preserving encryp- 
tion can be easily extended or applied to documents in other formats (such as HTML/XML, Microsoft WORD, Acrobat 
PDF etc.). In a token-based format such as the Xerox DigiFaper, each page image of a document is represented as 
a 'dictionary" of token images (such as characters and graphics elements) and location information (indicating where 
those token images appear in the page). Thus, multiple occurrences of the same token in the document can be rep- 
resented using just a single image of that token in the dictionary. 

[01 08J The process of rendering a document in such a format is then accomplished by consecutively reading in token 
locations, retrieving images of the tokens from the dictionary and drawing the images at the specified locations. The 
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benefits of token-based documents are compact file si2e and fast rendering speed for use in distributing, viewing and 
orintina of electronic documents. In the DigiPaper format, tokens are stored as binary images using the CCITT Group 
4 compression format, or as color images using JPEG compression, and the position information of the tokens is further 

?oTo P 9i eSS Fo d r convenience 8 awten-based document D of P pages is formally modeled as a table (dictionary) of tokens 
T of «ize HI together with a sequence of P tables of locations of size IL^I (1 s i S P), representing the P page images. 
Each entry TOM < j * HI. is a pair (idljl.tffl) of an identifier idffl and an image tffl of the j-th token. Each entry LJk], IS 
k s iu in the i-th'image location table L, is a triple (id[k). X [k].y[k]) representing the k-th token occurrence in the ,-th 
Pagetag where id[k) is the token identifier, and x[k]. y[k) are its x- and y-coordinat • *2 
fk1)-th Sen occurrence in the page. Forexample.takethesimpledocumentshown^Rgurell.Thetokendctionanr 
LJ Lr-Z table (usinq x y coordinates) for this document are shown in Figures 1 2 anb 13 respectively. 
SST^SLffitliJS^ Render(D) below shows how page images of a document D are rendered In 
" code 1 o arT a he base references for the x- and y-coordinates for each page. LookupfT.idM) is a subroutme 
„at uSoMh'e k£ of the dictionary T and a token identifier idfk], returns a token image t n T corresponding to the 
given identifier, and Dr6w(x,y,t) is a subroutine mat draws the token image t at the location (x.y). 

Render(D) 
{ 

Load T into memory 
fori=ltoPdo 

{ 

Load Lj bto memory 
x = x© 

y=yo 

fork=l toLldo 
{ 

x = x + x[k) 
ysy+y[k] 
tsLookup(T4d[k]) 
Draw(x,y,t) 
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101111 in addition to me shitt.ng uanslormation X = x + a. y = y ♦ b as used in the schematic rendering process 
H«lrLd above Lie are several other coordinate transformations that may occur during the document rendering. 
01lT ScaS' The so 8 „g transfonmation is of the form x" = ax. / = by. where a and b are scaHng factors for the x- 
coordLtf^fy-coordinate respective^. Scaling may be caused by resizing the display window or pnnt paper. 
[01 1 3] Rotation. The rotatiqn transformation is 
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for some consianis a, b, c, d, which form a 2-by-2 rotation matrix. This transformation is needed when the page image 

[oi^ te d Affine Transformation. An affine transformation is one of the form x = ax + by + e; y = cx + dy + f for some 
constants a, b, c r d, e, f. In the vector form, it is: 



Clearly shitting, scaling and rotation transformations are special cases of affine transformations. It is those affine type 
transformations that make it possible to achieve a high-level trusted rendering under encryption of coordinate infor- 
mation using additive encryption schemes described below. 
20 [0115] A special class of encryption schemes, namely, additive encryption schemes, are used to carry out blind 
transformation of functions of the affine type, which provides a foundation for trusted rendering of documents. Blind 
transformation by a rendering transformation R and R' of an encrypted document satisfies the relationship: D(R'(E(x))) 
= R(D(E(x))), where E is an encryption function and D is a decryption function for E. If E(x) is an additive encryption 
scheme, then R' - R. 

25 [0116] An encryption scheme S generally consists of basically five components: (i) a message space X which is a 
collection of possible messages, (ii) a ciphertext space Y which is a collection of possible encrypted messages. (Hi) a 
key space K which is a set of possible keys, (iv) a computationally efficient encryption function E:Kx X-> Vand (v) 
a computationally efficient decryption function D : K x Y-* X . For each key k G.K. there is a unique key Id G K. such 
that the encryption function E k = E(k.) : X-> Y and decryption function O,' 1 = W\) : Y-* X satisfy that, for every 

30 message x e X, D* 1 < E * W) s * The kev k is cal,ed £n encrv P tion ke V and k " 1 rts corresponding decryption key. 

[01 17] Such defined encryption schemes can be varied in several ways to cover a wide range of concrete encryption 
schemes used in practice. One variation is to consider whether or not keys used for encryption and decryption are 
different In the case where all encryption keys k are same as their corresponding decryption keys Ic 1 . the scheme is 
a symmetric (or private-key) one; otherwise, the scheme is asymmetric. In the case wnere, for all possible k, Ic is 

35 different from k and computationally difficult to derive from k. the scheme is a public-key encryption scheme. 

[0118] Another variation is to differentiate deterministic and probabilistic encryption schemes. In a deterministic 
scheme all the encryption and decryption functions E k and D w are deterministic functions, while in a probabilistic 
scheme'the encryption function E k can be non-deterministic, namely, applying the function to a message twice may 
result in two different encrypted messages. 

40 [0119] An additive encryption scheme is an encryption scheme whose message space X and ciphertext space Y 
possess some additive structures and encryption function E k = £(*.) : X^ Y is homomorphic with respect to the additive 
structures Specifically, let X = (X, +, 0) and Y = (Y.©,0) be two commutative semigroups with (possibly different) zero 
elements 0 satisfying, tor example, for all x, x + 0 = x and 0 + x = x, and efficient operations + and ©. An encryption 
scheme is said to be additive it, for any k G K and any x, x' G X, E k (x + x') = E k (x) © E k M, and the operation © does 

45 not reveal the clear messages x and x\ The last condition on © makes additive encryption schemes non-tnvial. Without 
this condition, me operation © on Y can be trivially detmed y © y' = E l; (D^(y) + D fc . ,(/)); that is : ii is accomplished by 
first decrypting ine aroumenif. ihen addinp Ihem lopempi and finally rp-pncrvpiinp ihe result 

li.U-Uj uoseivK ir.UH ;< ^.im-i # r.ciyr-licr. <.ci.trv.i-: uliir .u .mi , » ... . n,crypm.n scheme is said 10 t* 

multiplicative \\ its spaces x'and Y have the ring structures (i.e., in addition to their additive structures, they nave 
so respective multiplications x and <8> that are distributive over their additions + and 6, and multiplicative identities), the 
encryption function E k is homomorphic with respect to the multiplications. E k (x x W) = E k (x) ® E k (x»); and the operation 
® does not reveal the clear messages x and x\ 

[0121] In general, additive (as well as multiplicative) encryption schemes are not non-malleable, since a non-malle- 
able scheme requires that, given an encrypted message it is (at least computationally) impossible to generate a different 
55 encrypted message so that the respective clear messages are related. Accordingly, they have a weakness against 
active attacks where the adversary attempts to delete, add or alter in some other way the encrypted messages. How- 
ever, when these schemes are used to encrypt documents, extra measures in data integrity and message authentication 
can be taken to reduce risks caused by these active attacks on document integrity as well as confidentiality. Moreover, 
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end users are less motivated to initiate active attacks, as the attacks will affect document contents that the users are 
aoinq to use and consume. 

F01221 Not all encryption schemes can be defined as additive ones in an easy and natural manner. In fact, some 
encryption schemes are designed with a requirement of being non-additive or at least being able to convert into non- 
5 additive Nevertheless, there are many examples of additive encryption schemes that can be used in the method o 
format-preserving encryption and trusted document rendering. Mutt, Exp and EG (three determmistc schemes), OU 
(probabilistic) and RSA are examples of additive encryption schemes (with varying degrees of vulnerab.lrty to attack) 
may be used in the format preserving method. 

[0123] Multiplicative Cipher (Mult) is a symmetric encryption scheme, where X = Y = 7^ = (0, 1, ... n-i) tor some 
io integer n>0. The encryption of a message x using a key a is , 

y = E a (x) = ax(mod n) 
is and the decryption of a message y using a key a is 

x = D a (y) = i\{mod n), 

zo where a* 1 is the multiplicative inverse of a modulo n. . 

Toi24] Exponential Cipher (Exp) is a symmetric cipher, where X = Z,., and the cphertext space Y = Z, £r some 
prime p, and K is the set of all generators of the multiplcative group Z* p . For any generator g 6 K, the encrypfon 
function is defined as the exponential function 

25 E s (x)=g , (modp), 

while the decryption function is defined as the logarithm function 
so D g (y) = iog g y(mod(p-l)). 

[0125] Semi-probabilistic EIGamal Cipher (EG) extends the exponential cipher to the EIGamal cipher which leads 
he EIGamal cipher to run in a semi-probabilistic mode. For each message x € ^ where fcV^J£™Z 
prime p, g is a generator in the multiplicative group r p , the private decryption key for a user is a randorr .number e6 
^ the public encryption key a = gB(mod p) G Zp, the encryption E a (x, r) depends on a unrtormty chosen random 
number r e Z'p.v 

E a (x,r) = (g r (mod p). xa r (mod p)) = (s,t). 
For an enciypled message (s, t), the decryption function is defined as 

D 0 (s.t) = t(s Q )" 1 (modp). 

101 261 The EIGamal cioher in its original lorm as described above is hardly additive. However, the operator e can 
t>e partially defined on me cipr.enexi ol those xs that snare e same random numbe. r, as follows : 

E a (x, r) e E„ (x', r) = (s, l) © (s, f) = (s, t + f) = E„ (x + x'(mod p), r). 

[0127] This partially defined operation is applfcable wher, a batch of messages are encrypled using* same random 

ISi^ Okamoto-Uchiyamai Cipher (OU). Okamoto and Uchiyama proposed an additive, public-key encryption 
cheme in T otmo o and S. Uchiyama. A New Public-Key Cryptosystem as Secure as Factoring". ****** 
LectuTe Notes in Computer Science 1403. 308-318, 1998. which is probabilistic and provabiy as secure as the^rac- 
\SS^XSSi n . p*q against passive adversaries. Choose two large primes p. q of k bits for some k > O.and let 
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n - p^q Choose g E Z\ at random such that the order of 9p = '(mod p 2 ) is p. Let h = g" (mod n). The message 
space X of the OU scheme is the set T p (notthe set {1 ...^ as claimed by Ckamoio and Uchiyama) and the ciphertext 
space Y is Z For a user, a public key is a tuple (n, g, h, k) and its corresponding private key is the pair (p, q) of the 
primes. To encrypt a message x € X, a random number r € 2 n is chosen uniformly. Then,the encrypted message is 

t 

y s E,„, 9W) (x,r) = g"li'lmodn). 
To decrypt the encrypted message y, a "logarithmic" function L:r -* I", 

L(x) = (x - 1)p*' (modp 2 ) 



is used, where r is the p-Sylow subgroup of 2%* i.e., r = {x 6 Z*pj I x ■ 1 (mod p)J. With the function L, the decryption 
is function is 

x = D Piq (y) = Iff - (mod p 2 ))!^,)" 1 (mod p 2 ). 

20 [0129] New additive enciyption schemes can be constructed from existing ones vie the composition construction of 
encryption schemes. The composition construction can also be used to construct additive encryption schemes from 
non-additive ones. For instance, the composition of the exponential cipher Exp and any multipl.cat.ve encryption 
scheme S (such as RSA) results in an additive one. 

10130] Additive encryption schemes enable blind transformation with partially encrypted data, which serves a foun- 
dation if or trusted rendering of documents, as discussed above. In particular, additive encryption schemes can be used 
to perform blind transformation of affine functions with clear coefficients and encrypted variables. 
r0131] Returning to the example of a token-based document, since a token-based document D cons.sts of a dictionary 
T of token images and a sequence of location tables L, (one for each page image), the idea is to encrypt the content 
of the dictionary T and location tables L,, resulting in a dictionary T of encrypted token images and tables L', of encrypted 
locations Recall that the dictionary T consists of a collection of pairs (kffl tffl). j = 1 . ... m. Associated with T is a 
subroutine Lookup in the rendering process that, given a valid token identifier id, returns its correspond.ng token image 
t in T. In encrypting the dictionary T. there are three basic choices: encrypting token identifiers, token images, or both. 
Encrvotinq either identifiers ortoken images helps unlink the connection between the identifiers and their token . images, 
in addition encrypting token images protects proprietary token images. In any case, it is desirable to allow val.d access 
to the dictionary only within the rendering process P, while making it computationally difficult to obta.n a copy of the 
entire clear conients of the dictionary. This is possible because in many cases the valid identifiers (e.g., Huftrnan 
codewords) are only a very small subset of all binary strings of up to a certain length, and consequently any exhaustive 
identifier search will not be efficient. 

T01 321 More formally, given the dictionary T and the Lookup subroutine that accesses it. the requirement on encrypting 
the dictionary is that the encrypted dictionary T" and tne corresponding subroutine Lookup' satisfy the following con- 
straints: 

(1) For any encrypted identifier E k (id), Lookup'CT.E^id)) = E k (Lookup(T.id)) and 

(2) Given T and Lookup', it is computationally infeasible to reconstruct T. 

101331 For an encryption scheme S, T and Lookup' can oe constructed as follows. Lei ID be the set ol all syntactically 
possible identihers: in nanicular, ID'S ID, where IP" = M I fid.ti £ T). Lei h be a chp-wsv hash function whose doma.r. 
i: if. . "I feirf rypifcc: lofcfct. clk.iioi.Ki - '. * nmv;' it lie- : ■> . • ■< i - tl.Oc^.L,.!., , : ■■ 

inserted into T. The modilieo subroutine Lookup' uses me algorithm: 
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Lookup'CT4d) 
{ 

id' = h(id) 

{ = LookupfT 4dO 

return (0 

} 



when the Huffman code * used to co p ^ ^ sjmp(y usjng g enc ^p tlon 

XS'Z^«^& (M*E^ .nd .h. Lookup' »«— » 

process is given shown below. 



Render(D) 

I 
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fori.= 1 toP do 

{ 

s 

Load U into memory 
x = E k (*>) 

1Q y = Ek(yo) 

fork= 1 toILIdo 

{ 

15 x = x©x[k] 

y=y©yM 



so 



25 } 

} 

Draw'(x,y,t) 
{ 
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t = Lookup'(T4d[k]) 

Draw / (x,y,t) 

} 



x = Dk.i(x) 
y=D*.i(y) 
t = Dk.i(t) 
Draw(x,y,t) 



} 



During the process, all the coordinate and token image information remains encrypted before calling the subroutine 
Draw'fx.y.t). This is possible for the coordinate information because the encryption scheme is additive. Consequently, 
the content protection level and rendering process performance of the rendering process rely on the security strength 

45 and computational complexity of the scheme used. 

[0138] in another emDodimeni ol the invention, a dipiiai work is. polarized enabling trusted rendering or replay of the 
diciital work wiinoui depolarization ol Ihe diaiial content or iiv presentation del*. In thir embodiment the digital work 

it u,e type- which mciucjt; uiriu .. u i.u-m bug retwi.* ii.u ( ;mi i i b\\*x 1. 1 \ : u:r„ c omext). Resource inlormatioi . 

includes formaning inlormatidn or otner information used oy a replay or rendering application to convert the digital work 

50 into presentation data. 

[0139J Polarization is a type of transformation which renders the original content unreadable or unusable. For a digrtal 
work w, a polarization scheme T, which uses a seed s, generates a polarized digital work w 1 according to: w* «= T(w, s). 
The same transformation T may also be used to generate the polarized resource information S' according to S' - T(S, 
s). In this example, a seed s is used to make reverse engineering of the polarization scheme more difficult. 
55 [0140] For example, a document type digital work may be polarized using a simple polarization scheme. In a docu- 
ment, the digrtal content comprises a series of characters in a particular order or location. If the document is to be 
displayed on a viewing device, each character must be able to be displayed at a particular location for viewing by a 
user on the viewing device, such as on a monitor. A coordinate system is required for displaying each character on 
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the monitor so each character in the document can be displayed on the monitor. The ^f^^^ 
InfoZtLn which is referenced by the monitor's coordinate system. For example, in this paragraph, the letter P 
aDDears at the top line, indented by five spaces. 

SET A simple polarization scheme for Jumbling the text of the above paragraph to to translate the taato^ 
ene s with Sect to the coordinate system. Each letter in the paragraph has an (x,y) location. Suppose the local on 
^SSm^n the above paragraph are polarized using a seed (a,b) from a user's system. The following polar- 
ization functions may be used to polarize the above paragraph: 



Y ss D*, for the vertical axis; and 
10 X = x/a, for the horizontal axis. 



45 



[0142] in this example, the user's device coordinate system must be polarized in order tor the ^^S^k 
transform thedigital content into presentation data, i.e., display the paragraph on the monrtor <*cn«taMTta usert 
deviceToord Je system must be polarized using the same seed (a. b) to generate a potenzed coordinate system. 
The following transformation functions are used to compute both x and y locations of a given point. 



5J 



V= logbCO, for the vertical axis; and 
X= aX, for the horizontal axis, 



oiven bv (X Y) = (x/a, bv). This value is then applied to the device coordinate system (X,V) = (log b (Y) aX) -<x,y). Thus 
the c^erf location^ "P is displayed on the user's monrtor. In both cases of polarization, the polanzed forms of the 
ZZ T£H* and the digital work maintain an inherent association. These ^^^^Zl 
the resource information and the digital work result in the basis for a effective mechanism to protect he digrta. work. 
Whill Z I replay application is able to display the polarized digital work, it is only wrth the polarized system context that 

^SESZZtgZ T25^SZ~* as encryption, depending on the sens^o, 
SZ dtLlwork to be protected different levels of polarization can be used. A sensithre work may require a hgh level 
2 DoSSaSr a lower valued work may require a weaker type of polarization. If the user's environment « trusted, a 

™y be used - An advama9e 10 us,n9 a lower ,evel oi pM T- *< T x LTSSS 

y^em reso xes to create the polarized digital work and to render or replay the po.arized digital worfcTh typa and 
□ualTof the polarization seed may also be used in combination with the polarization scheme to Mnmtti level 
and uengm of the polarization. For example, a more complex polarization seed (such as one containing authorization 
S3 i from a trusted source or a dynamic seed) will provide a higher level of polanzation and strength. 

or or to distribution to the user or customer using a polarization scheme chosen by the manufacturer or distributor 
^^ZZm to be polarized may also be presented in advance to delivery. Preferably a seed • used for 
^rSSoTSi... Lo preferabV. the seed is generated using information provided by the user's system 



40 context. 



Tm When a user purchases a digital work, the user preferably provides information from the user system in which 
me user MM to replay the digital work. This information may be used to generate the polarization ^ed for both the 
Llied dotal work and the polarized resource information (sometimes called the polarized system context). Then 
m?lSiz?d tZ worv and polarized system context or po.arized resource information are provided to the user. 

P cf„S ma r be encrypted prior to distribution to the user. Decryption of both the polanzed digital work and 
sSem coS may be reould prior to replay of the solarized digital work into presentation data, depending on the 

encryption scneme uset ^ ^ ^ ^ ( ^ ^ ; „„, , ,. • : , ( „ , m. pe,*ra»,„ , 

, nc lfa ,.iai.o.'.otmeo. 9 ii8iworkanc.pc i &rizai.onoitneresou.ceimormat.on.OncethepolarBaiion 
Td «« h it. The polarization engine takes as input the dig^ work or 

he ?eso^ f orrnation. and generates the po.arized form of the digita. work or the 

he transformation function seeded with the polarization seed. During replay of the polarized digital work, he polanzed 
resource So is utilized to generate the presentation data and/or image data. The same or different polanzation 
information functions can be used for the digital work and the resource information. 

jTHml bating a polarized digital work is shown with reference to Figure 14. A digital work 1410 
Tdes Sig a. con,l°nt and a se, S resource information used for formatting and rendering the «£ eoMe* jjo. 
onm usable orviewable by a user. The digital work 1410 goesthrough a process of content polanzation 1420 in which 
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the digital content is polarized and the resource information is preserved, creating polarized digital work 1422. The 
content polarization 1420 may occur as shown with reference to Figure 9. A digital work typically includes content, 
instructions and formatting. While polarization can occur to the entire digital work, preferably only the content is polar- 
ized; the instructions and formatting are not polarized. However in some instances, for some replay applications, some 
5 of the resource information contained within the digital work may also be polarized. This is similar for the format pre- 
serving encryption method described above. 

[0149] Resource extraction 1412 extracts at least one resource information from the set of resource information 
associated with digital work 1410. Extraction consists of copying the resource information into a system resource file 
1414. System resource 1414 is then polarized at resource polarization 1416 to become polarized system resource 
io 1424. The polarization scheme for content polarization and resource polarization need not be the same. Preferably, 
each polarization scheme employs a polarization seed 1418 which is generated by seed generator 1426. Several 
exemplary methods for seed generation are described below. In particular, in a preferred embodiment, the polarization 
seed is based on unique information from the user's system. 

[01 50] Several techniques for generation of the polarization seed may be used. For example, a seed generator which 
15 generates a number from a random number generator may be used. This method, referred to as stateless polarization, 
does not depend on any secret key information and user system information. The process for stateless polarization 
yields a specific value for the system for polarization. The inherent vulnerability for digital security systems may be 
found in mishandling secret information, mathematical complexity, and algorithmic complexity. Eliminating the secret 
information seals off one target of attack. With stateless polarization, a random number generator produces the polar- 
20 ization seed. In this case, once the polarization process is complete the seed is discarded without a trace. Hence, the 
security of the system is free from attack focused on compromising the secret information, and the user need not 
divulge sensitive information that may be deemed a privacy violation. 

[0151] Another seed generator that may be used is a state-based generator. The state-based seed generator con- 
structs a seed by first acquiring system state information from the user's replay system or rendering device. System 

25 state information includes hardware identifiers, system settings and other system state-related information. While there 
is much value in stateless polarization, other security requirements may require use of an inseparable linkto a particular 
user system or device. By generating the polarization seed from system/device-specific information, the polarization 
engine will produce a digital work that is polarized to a form lhat corresponds to a specific system/device. 
[0152] The polarization seed generator can also be tied to an authorization process. In authorization-based polari- 

30 zetion. the seed generation can be tie in with the outcome of the authorization process. A separate authorization 
repository (which is a trusted source) provide authorization information as part of some other security feature associated 
with delivering access to a digital work to a user. The trusted source of authorization information may be an online 
authorization repository as described in US Patent No. 5,629,980. This authorization information is then used to gen- 
erate a polarization seed. 

3t> [0153] If a stateless polarization seed is used, the digital work and its resource information may be polarized and 
stored together for delivery to a user when a user purchases the associated rights of use for the particular digital work. 
If one of the other polarization seed generation methods is used, polarization typically must wait until the user provides 
the system state or authorization information before the digital work and resource information may be polarized. 
[0154] An embodiment which provides a higher level of protection in terms of ensuring that the digital work may be 

40 replayed only on a specific physical system or device uses a dynamic state-based polarization seed. In this embodi- 
ment, a polarization engine and polarization seed generator must be provided to the replay application or rendering 
device along with the digital work and resource information. In this embodiment, the digital work and resource infor- 
mation are polarized prior to replay and rendering using a seed which is generated based on the dynamic state of the 
particular system or device. The dynamic state may come, for example, from the system clock, CPU utilization, hard 

45 drive allocation, cursor coordinates, etc. By polarizing the work using a snapshot of a dynamic state, the work is locked 
to a particular system configuration (i.e., state) in time. Polarization of the digital work : and ultimately its blind replay 
(described Delow) s is fcaseo upon a Dynamically evolvinp siait*. The evoiuiion of ihe dynamic state does not yield unique 
: i-crfri iiik»in-,£iiot. thc.i yjw ^i n-.r iv.va.tvilii'j r=! tin*. (.uiLr.L«..s = ■ : : :..i.f i.Mii't e :; n.-mic- slate based polarizatior 
makes compromising tne polarized digital work and system ccniext more ditliculi. Since the polarization process is 

so carried out within a trusted system, it is implied that the process can not be deconstructed. 

[0155] The actual process of polarization can be, as described in the example above, an algorithmic-based trans- 
formation -parameterized by the polarization seed. During polarization, the data and resource identifiers of the digital 
work are transformed as described above. The structure of the digital work is unaltered, however, such that the original 
format, such as PDF, DOC, WAV, or other format, is retained much like in the format preserving encryption. Similarly 

55 the polarization of the resource information yields a polarized form of the resource information such that the resource 
identifiers, element identifiers and resource characteristics are transformed, yet the structure of the system context 
remains unaltered. By polarizing the digital work and resource information according to the same seed based on a 
user's specific device or system information, an inseparable relationship is established such that the work cannot be 
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replayed to its clear form with any other device or user system. If circulated in an unauthorized manner, the protection 

mi56] S Duringblind replay, the unique characteristics of the polarized resource information enable the replay appli- 
cation to properly replay the polarized digital work and generate unpolarized or clear presentation data. Because the 
dioital work and the resource information were transformed in a complementary manner, the polarized elements of the 
digital work such as the resource identifiers and data, unknowingly reference the complementary elements within the 
resources of the system context. Due to the matching transformation the proper elements within the context are iden- 
tified by the replay application such that the resultant presentation data appears in the clear. Hence, the work is pro- 
tected until the last possible moment after replay. • 
f0157l As discussed earlier, the conventional distribution of digital works via the web is relatively straightforward. 
The work is created using an editor, posted to a web site, accessed by the user audience and replayed in a viewer or 
on a display system If a content owner does not desire to protect his/her digital work (or if the content owner trusts all 
users who will receive the work), the digital work is provided "in the clear i.e.. without any encoding, encryption or 
other protection for direct use by any user. 

td 581 If the dioital work is downloaded onto the user's system. It is typically stored in memory. « the digital work is 
provided via a storage media, such as floppy disk or CD-ROM or DVD-ROM, the digital work is usually accessed 

foS 'TortenfplTy the digital work, referring to Figure 15, the digital work 1510 is provided to a replay application 
1512 In the case of a document or other type of digital work which requires formatting information or resource infor- 
20 mation the digital work will include digital content plus resource information setting forth the particular system context 
or system resources needed by the replay application to process the digital content. For example, the digital work 1 510 
may be a text document in which the text is displayed using the Arial font. When replay application 1512 accesses 
resource information on digital work 1510 indicating Arialfont is used, it accesses the a PP ro P ria, ^y s, ^. m ; e l s^>u, ^ 
1516 (which in this case is the Arial font table) and uses the system resource information to convert the digital content 
25 into presentation data 1514. . 

f01601 In some replay applications, converting the digital content into presentation data is sufficient for use by the 
Uer In others presentation data is only an intermediate form which musi be further converted. For example, in the 
case of a display system 1524 which is a printer, the presentation data 1514 must be further rendered by rendering 
application 1518. Rendering application 1518 may be a decomposer within the printer. Rendering application 1518 
u<e« other system resources 1516 to transform the presentation data 1514 into image data 1520. Image data 1520 is 
in a form which can be directly displayed on display device 1522 (in the case of a printer output as a printed docunwrt) 
101611 In addition to the earlier described systems and methods for protecting a digital work during replay, a digital 
work mav be protected during replay by polarizing the digital work in accordance with a first polarization scheme which 
produce* polarized content and preserves the digital work's resource inf ormalion. Aportion of the digital work's resource 
Information is copied and polarized in accordance with a second polarization scheme. Referring to Figure 16 reptay 
application 1612 uses the polarized resource information 1614 (and any other system resource informahon 1616 that 
mav be required) to transform Ihe polarized digital work1610 into clear presentation data 1618. Presentation data is 
necessarily in the clear, which means it can be captured by other programs (such as a screen capture utrttty program). 
H^eMhe outpul of such other programs is not in the same format and frequently not of the same f.delrty as the 

40 original dptal "JjJ^ resource inf orma tion can be thoughl of as acting like a polarizing filter to bring the polarized 
dioital content into a clear image (presentation data). This system is a blind replay system in that the replay application, 
which can be any commercial application, does not know or need to know the clear digital content. Blind replay operates 
tor any transformation function R, such that R(w'.s*) = R(w.s). where w' is the polarized digital content w ,s the clea 
dioital content s' is the polarized resource information and s is the unpolarized resource information. Blind replay of 
poianzed digital works using polarized resource information is diff ereni from blind transformation described above in 
mat Blind .eplay produces clear p.eseniation data without having to depolarize it. In Wind translocation, the replay 
apol.ca.ion convene the enciyn.ed dioital work into encrypted presem E .,on data, which murt men he decrypted. In both 

ir'.: II.. IBMW+! MJIH HKIIif.lU(«'l r '' <11 II...I.-J K'li. 

101631 Bl.no repiay (also called one renoering) using a polarized digital work and polarized resource intormation 
can be used alone to protect the digital work during replay as well as in addition to regular encryption. For example, 
the polarized digital work and polarized resource information may be encrypted to protect it during distribution .then 
decrypted at the user's system into the polarized digital work and polarized resource information. The user must firs 
obtain permission from the content owner or the distributor acting on behafl of the content owner (in order to decrypt 
the encrypted digital work). Once the user is qualified, the encrypted polarized digital work and the encrypted polarized 
resource information are decrypted and the polarized digital work is replayed in the replay application using the polar- 
ized resource information. 

[0164] The complexity of rendering a digital work into a usable form for viewing by a user can be used to further 
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protect the digital work during replay. Referring to Figure 17, po la ri2ed digital work 1710 is provided to replay application 
1712, which uses polarized system resources 1716 and other system resources 1718 to transform polarized digital 
work1710into partially polarized presentation data 1714. In this embodiment, display system 1728 is needed to trans- 
form presentation data into a form usable by the user. Partially polarized presentation data 171 4 is provided to rendering 
application 1720 which uses polarized system resources 1716, local system resources J 722 and system resources 
171 8 to transform the partially polarized presentation data 1714 into clear image data 1724. Clear image data 1724 is 
then displayed on display device 1726 for use by the user. In this embodiment, presentation data is still polarized, 
taking the location of the clear data to a later point of the display process and providing further protection. 
[0165] To enhance usability of the system for polarization of digital works, the polarized resource information may 
be separated from the digital work and tied to a transportable device such as a smart card. In this embodiment, the 
replay application 1 71 2 plays back the work using the polarized system resources 1716. Instead of having the polarized 
system resources 1716 stored in a local memory, along with the polarized digital work, 1710, the polarized system 
resources 1716 is stored in a transportable device such as a smart card. Also, the smart card, possibly with hardware- 
enhanced features, may possess attributes that provide for tamper resistance. Within the transportable context, the 
polarized data is processed by the replay application 1712 to yield the partially polarized presentation data and then 
provided to the rendering application 1 720. 

[0166] Many different types of digital works can be protected throughout use using the polarization method. For 
example if the digital work is a document or text file, the replay application may be a word processor, system resources 
or resource information may include font tables, page layout, and color tables. If the digital work is audio or video data 
(e.g., streams), the replay application may be an audio or video player. The presentation data will be the audio/video 
final data stream. The display system may be an audio/video device. The rendering application may be the audio/video 
device driver. The image cata may be the audio/video device cata stream and the display device may be the audio/ 
video rendering device (speaker or monitor, for example). 

[0167] For a digital work that is an audio/video data stream, the system resources or resource information may 
include characteristics of the audio/video device: sample rate (samples per second - e.g., 8 kHz, 44.1kHz), sample 
quality (bits per sample - e.g., 8, 16); sample type (number of channels - e.g., 1 for mono, 2 tor stereo), and sample 
format (instructions and data blocks). A table of some audio/video data streams and their corresponding resource 
information or variable parameters which can be selected for polarization is set forth below: 

Tabid: 



Digital Work: A/V Data (Streams) 




Extension 


Origin 


Variable Parameters (# Fixed) 


Compression 


Player 


.mp3 


MPEG standard 


sample rate, quality, tftype 


MPEG 


MP3 Player 


.ra 


Real Networks 


sample rate, quality, fltype 


Plug-ins 


Real Player 


.wav 


Microsoft 


sample rate, quality, tftype 


ADPCM 


Window Media 


.snd 


Apple 


sample rate, Equality, fttype 


MACE 


QuickTime 



[0168] The structure of a digital work can be used advantageously for polarization. While it is possible to polarize 
the entire digital work, it is more convenient to polarize only a portion of the digital work. Most digital works include 
three primary elements: instructions, data, and resources. Preferably, only the data and resources of the digital work 
are polarized, much like the format preserving encryption method described above. By selectively transforming only 
the data and resources, a digital work may be transformed such that the content remains in the original format, yet the 
data and resources are incomprehensible. 

|016S] The general layout ol a digital work of the document type is shown in Figure i£. In Figure 16. digital work 150 
include f-snf Dtrcripic-i U : . C rnirrlf com ij / if-t p.r.f iff hnt ■iirw- loc-i.iiiv • V.\ uirT«Mr. lfiO fine! if-4. II* 
kage Descriptors i5k define Jiifc yeneral layout oi l work, roi insianct , the pa^ viiu : pagt nurnucri. anc margins lali 
into the category of Page Descriptors with respect to digital oocuments. Control Codes 154, 158 and 162 are similar 
in that they describe the presentation of the content. Examples include commands to set text position, output text, set 
font type, and set current screen coordinates. Resource Identifiers 156 simply reference the desired resources. In the 
digital document realm, resources could vary from font typeface to background color. Finally, Data 160, 164 represent 
the core information communicated by the digital work. This could be the drawing coordinates used in a multimedia 
clip or the character codes for rendering as a digital document. 

[01 70] An example of a digital work (in this case a simple digital document) and one of its polarized forms are shown 
in Figures 1 9 and 20, an HTML document in clear and polarized form. The tags <html> and <body> are Page Descrip- 
tors The <font>...<Vont> tag is an example of a Control Code for setting font resource characteristics, while "AriaP 
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1S 



20 



and "14- are Resource Identifiers for an Arial typeface. 14 point font. The 'Hello World" text is the Data, or the core 
information of the work. The <p> is another Control Code to signalthe beginning of the paragraph. Finally, the document 
is closed out with Page Descriptors <\body> and <\html> to identify the end of the document. 
101711 Fioure 20 shows what the digital work of Figure 1 9 looks like in a polarized form. It can be seen that the Page 
5 Descriptor and Control Code tags remain unaltered; the <html>, <body> and <font> tags are unchanged. Whereas, 
the Surce Identifiers, "Aria." and "14", have been transformed to indecipherabte values. Similar* the Date -Hello 
World" has also been transformed to an indecipherable value. By transforming the Resource Identihers end the Data 
the content is rendered meaningless while in the polarized form. Yet, the fact that the Page Debtors and Control 
Codes remain intact allows torthe document to retain its original format, whch m general could be HTML. Adobe PDF. 

Astern resourS arable to a replay application on a particular system. For example, ft may include the Font Table. 
c£ PaSt? System Coordinates and Volume Setting. When a digfta. work is input to a replay applica tion the . rephjy 
apSon usesthe particular resource information contained within the digital work to transform the d.grta. contetf 
fnto SesentaSn data Each system context or resource information contained within a digital work .s or can be arte ed 
o b u"que o a system tor which « can be replayed. The system context is a required element for he use of the 
23 tying use of the digital work to a specrfic system or physical device or replay apphcat on fo r refja* The 
Resource Identifiers and Data within the digital work may either directry or indirectly reference elements contained 
wShintS ystemcontext. Polarizing. he digHal work and system context enable blind rendering. nto dear pres« 
Ta By pirizing the system context with a polarization seed that is tied to a unique system the resu*.ng polarized 

with the same polarization seed, may be accessed and replayed. M~*tn~ 
mm Rgure21 illustrates a typical configuration of ^ 

ResID) element identifier (ElemID). and resource characteristics (Characteristics). The ResID .ncludes pertment im 

wZ Ihe resource Finalty, the Characteristics are the actual resource characterise used to express the .nd-vdua. 

W% C °%™% is an illustration of the resource for the font table pertaining, o the Ariel t^^SZS 
dlnSer in this case is the font name. "Arial". Following the ASCII convention, the number 48 .dentrf.es the .ndw.dual 
so ^ZeZ^teM*, The resource element characterise torthe ElemID representee information to express 

miirFigure 23 is an illustration of the polarized the system context for the font resource shown in Figure ,2* The 

1 identifier itself is transformed to "k1 3K2". The element identifier rfserf need not be tra <*^« ' 
enough to transform the resource characteristics alone. In this case, "48" .s dep.cted as transformed to express the 

35 EE - ?^^^ ™* be used ,or many dB ™ rt types H 0, K di9rta, TT-ZZSZSZ 

mentV polarization and blind rendering may be usedfor audio/video data. As noted above, aud.o/v.deodata « generaMy 

SZSSSSZL of streams. A replay application is the audio/viaeo player whteh transform* ; the dj-ta 

stream into a final data stream which can be processed by a transducer (speaker) into an aud.o output or by a d.splay 

bv «?mplmg the audio/video input streams 1710 at some sample rate, quality and type accepted by a target aud«rf 
vtoeo devSfe It uses the audioAvideo system resources to sample, mix and produce audio/video «mrtM 
m*es the resampled audio/video streams to produce a final audio/video stream in a format expected by the target 
«. device t \Z 'caTof an audio/video player, the presentation data 1714 n the final mixed audio/video stream at some 
samDle rate quality, type and format expected by a target audio/video device. 

£wf The To* audio/video dev.ce (e.g., rendering appl.cetion -,720) is some hardware system the, ,s abteto 
onven the audio/video stream (presentation data 1714) a, a specil.c samp., rate, quality, type (channel) and t ormat 
mmc PAl orNTfO.r ,., r.u.i.wvklee datr ;7U i : «.* h ...r.U.df. soundcards. speaker^ 

,. rnonnor, a.,o the d.gi.e. ,o ana.o, .u.vene. .ocateo with.., u.e t uu.u,v,oeo o«, Many oevces a.e able ^ .*jy aud.0/ 
vioec sueams a, a'ange of ditte.en, sample rates. Image data 1724 (e.g. an audio signal or a v.deo .mage stream) * 
Generated by the audio/video device driver 1 720 and "consumed" by the display dev.ce 1 726. 
mm Fo. example, to polarize an audio/video data stream, it may be split into two or more separate streams. One 
ueam .s po.a zed a d one stream is unpo.arized. Each stream may have different device charactenst.es (resource 
55 n Za tort sample rates, channels, qualities and/or formats associated with it. The device charactenst.es (one or 

[oi SOP etdTepty of the polarized audio/video stream is accomplished in a similar manner as for a polarized digital 
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document. The replay application (audio/video player) mixes together the unpolarized stream and the polarized stream, 
and using the polarized resource information, produces a polarized final data stream for the target audio/video device 
with a correct set of resource information. The target device (1 720) uses the polarized resource information to play the 
polarized data stream generating clear sound/visual effects (1 724). I 

[0181] While certain exemplary embodiments of the invention have been described in detail above, it should be 
recognized that other forms, alternatives, modifications, versions and variations of the invention are equally operative 
and would be apparent to those skilled in the art. The disclosure is not intended to limit the invention to any particular 
embodiment, and is intended to embrace all such forms, alternatives, modifications, versions and variations. For ex- 
ample, the portions of the invention described above that are described as software components could be implemented 
as hardware. Moreover while certain functional blocks are described herein as separate and independent from each 
other, these functional blocks can be consolidated and performed on a single general-purpose computer, or further 
broken down into sub-functions as recognized in the art. Accordingly, the true scope of the invention is intended to 
cover all alternatives, modifications, and equivalents and should be determined with reference to the claims set forth 
below. 



Claims 

1 . A method of protecting a digital work, z, during transformation by a transformation function, F, into presentation 
data F(z), comprising: 

encrypting the digital work, z, in accordance with an encryption scheme, E; 

using a blind transformation function P to transform the encrypted digital work E(z) into encrypted presentation 
data, F'(E(z)), wherein P is a function of F; and 

decrypting the encrypted presentation data, P(E(z)), in accordance with a decryption function, D, to obtain 
the presentation data, F(z), wherein D(P(E(z)) = F(z). 

2. The method of claim 1 , wherein the encryption scheme E is a format preserving encryption scheme. 

3. The method of claim 1 . wherein the encryption function E is an additive encryption scheme and wherein P = F. 

4. The method of claim 3. wherein the additive encryption scheme is selected from the group consisting of Mult, Exp, 
EG, OU, RSA and compositions thereof. 

5. The method of claim 1 , wherein P is a polynomial of F. 

6. A system of protecting a digital work, z, during transformation by a transformation function, F. into presentation 
data F(z), comprising: 

an encryption engine lor encrypting the digital work z in accordance with an encryption scheme, E; 

a blind transformation function P for transforming the encrypted digital work E(z) into encrypted presentation 

data, P(E(z)), wherein F is a function of F; and 

a decryption engine tor decrypting the encrypted presentation data, P(E(z)), in accordance with a decryption 
function, D, to obtain the presentation data, F(z), wherein D(P(E(z)) = F(z). 

7. The system ol claim 6, wherein the encryption scheme E is a tormat preserving encryption scheme. 

: mt fvsiMT.olc.teir.. « . *-i.*u.ii.im encryption h:r..-.iu :. . i: wiiiiv c ;.c. vpm-i. f cheme and wherein P = C 

9. The system of claim 8, wherein the additive encryption scheme is selected from the group consisting of Mult, Exp, 
EG, OU, RSA and compositions thereof. 

10. The system of dairn 6, wherein P is a polynomial of F. 
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